Multiple instance learning of Calmodulin binding sites
نویسندگان
چکیده
MOTIVATION Calmodulin (CaM) is a ubiquitously conserved protein that acts as a calcium sensor, and interacts with a large number of proteins. Detection of CaM binding proteins and their interaction sites experimentally requires a significant effort, so accurate methods for their prediction are important. RESULTS We present a novel algorithm (MI-1 SVM) for binding site prediction and evaluate its performance on a set of CaM-binding proteins extracted from the Calmodulin Target Database. Our approach directly models the problem of binding site prediction as a large-margin classification problem, and is able to take into account uncertainty in binding site location. We show that the proposed algorithm performs better than the standard SVM formulation, and illustrate its ability to recover known CaM binding motifs. A highly accurate cascaded classification approach using the proposed binding site prediction method to predict CaM binding proteins in Arabidopsis thaliana is also presented. AVAILABILITY Matlab code for training MI-1 SVM and the cascaded classification approach is available on request. CONTACT [email protected] or [email protected].
منابع مشابه
pyLEMMINGS: Large Margin Multiple Instance Classification and Ranking for Bioinformatics Applications
Motivation: A major challenge in the development of machine learning based methods in computational biology is that data may not be accurately labeled due to the time and resources required for experimentally annotating properties of proteins and DNA sequences. Standard supervised learning algorithms assume accurate instancelevel labeling of training data. Multiple instance learning is a paradi...
متن کاملMBSTAR: multiple instance learning for predicting specific functional binding sites in microRNA targets
MicroRNA (miRNA) regulates gene expression by binding to specific sites in the 3'untranslated regions of its target genes. Machine learning based miRNA target prediction algorithms first extract a set of features from potential binding sites (PBSs) in the mRNA and then train a classifier to distinguish targets from non-targets. However, they do not consider whether the PBSs are functional or no...
متن کاملInteractions of calmodulin with the multiple binding sites of Cav1.2 Ca2+ channels.
Although calmodulin binding to various sites of the Cav1.2 Ca(2+) channel has been reported, the mechanism of the interaction is not fully understood. In this study we examined calmodulin binding to fragment channel peptides using a semi-quantitative pull-down assay. Calmodulin bound to the peptides with decreasing affinity order: IQ > preIQ > I-II loop > N-terminal peptide. A peptide containin...
متن کاملComputational comparison of a calcium-dependent jellyfish protein (apoaequorin) and calmodulin-cholesterol in short-term memory maintenance
Memory reconsolidation and maintenance depend on calcium channels and on calcium/calmodulin-dependent kinases regulating protein turnover in the hippocampus. Ingestion of a jellyfish protein, apoaequorin, reportedly protects and/or improves verbal learning in adults and is currently widely advertised for use by the elderly. Apoaequorin is a member of the EF-hand calcium binding family of protei...
متن کاملDifferent Learning Levels in Multiple-choice and Essay Tests: Immediate and Delayed Retention
This study investigated the effects of different learning levels, including Remember an Instance (RI), Remember a Generality (RG), and Use a Generality (UG) in multiple-choice and essay tests on immediate and delayed retention. Three-hundred pre-intermediate students participated in the study. Reading passages with multiple-choice and essay questions in different levels of learning were giv...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 28 شماره
صفحات -
تاریخ انتشار 2012